Approximation Rates for Shallow ReLU$^k$ Neural Networks on Sobolev Spaces via the Radon Transform

Abstract

Let $\Omega\subset \mathbb{R}^d$ be a bounded domain. We consider the problemof how efficiently shallow neural networks with the ReLU$^k$ activationfunction can approximate functions from Sobolev spaces $W^s(L_p(\Omega))$ witherror measured in the $L_q(\Omega)$-norm. Utilizing the Radon transform andrecent results from discrepancy theory, we provide a simple proof of nearlyoptimal approximation rates in a variety of cases, including when $q\leq p$,$p\geq 2$, and $s \leq k + (d+1)/2$. The rates we derive are optimal up tologarithmic factors, and significantly generalize existing results. Aninteresting consequence is that the adaptivity of shallow ReLU$^k$ neuralnetworks enables them to obtain optimal approximation rates for smoothness upto order $s = k + (d+1)/2$, even though they represent piecewise polynomials offixed degree $k$.

Quick Read (beta)

loading the full paper ...